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Abstract 

When multiple sources of data need to transmit their rateless coded symbols through a single relay to a common 
destination, a distributed rateless code instead of several separate conventional rateless codes can be employed to 
encode the input symbols to increase the transmission efficiency and flexibility. 

In this paper, we propose distributed rateless codes (DU-rateless) that can provide unequal error protection 
(UEP) for distributed sources with different data block lengths and different importance levels. We analyze our 
proposed DU-rateless code employing And-Or tree analysis technique. Next, we design several sets of optimum 
DU-rateless codes for various setups employing multi-objective genetic algorithms and evaluate their performances. 

3 ' 



(N 



O 



I. Introduction 

Rateless codes HI, O, (3]| are modern and efficient forward error correction (FEC) codes. Each rateless 
■ code is determined by a degree distribution, which is precisely designed to achieve a capacity-approaching 
'""i . performance. 

In distributed data transmission using rateless codes, r data sources need to transmit their rateless 
encoded symbols to a destination through a common relay. In general, r sources may have different data 
t-h ■ block lengths and different data importance levels, which necessitate the design of flexible distributed 
^ ! rateless codes that can provide unequal error protection (UEP) of data for different sources. In this paper, 
j — we propose distributed UEP rateless codes (DU-rateless codes), which are a realization of such codes. 

It has been shown that the efficiency of rateless codes increases as the data block length increases JT), 
ED, 0, flU. Thus, in distributed rateless codes it is advantageous to combine the incoming symbols in 
the intermediate relay, which is equivalent to coding a larger data block. Moreover, by tuning coding 
parameters in each data source and parameters of the relay, UEP property can be provided for different 
^ ■ data sources. The problem in DU-rateless codes is to optimally design different degree distributions for 
> each source and to design relaying parameters to realize the desired UEP property and a minimal error 
^ rate for all data sources. 

Previously, two contributions flU and Q have studied distributed rateless codes. Authors in 01 have 
- - ■ designed distributed LT codes. In the proposed scheme in flU, the relay combines all incoming symbols 
that are coded at r 6 {2,4} sources with the same degree distribution. This coding degree distribution 
is designed such that the degree distribution of the combined symbols at the relay follows an optimum 
degree distribution called Robust-Soliton degree distribution [1J. This algorithm cannot provide UEP for 
different sources and obligates sources to have the same data block lengths. 

In jH, authors have also considered the case where the source nodes have the same data lengths, 
and all source nodes perform the encoding with the same degree distribution. Authors have also studied 
the case where the relay generates final output symbols with another independent degree distribution 
that determines how many symbols should be combined in the relay to generate an output symbol. The 
optimization for relaying and coding parameters has been performed separately in this paper, which may 
result in suboptimal performance. 

In this paper, we take several steps further compared to flU, Q, and propose DU-rateless codes that are 
inspired by UEP rateless codes 0, fl7J. DU-rateless codes are able to provide UEP for different sources 
that may also have various data block lengths. 

The paper is organized as follows. In Section HH we propose DU-rateless codes and analyze these codes 
employing And-Or tree analysis technique [8J. In Section [Till we design and evaluate the performance 
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of several ensembles of DU-rateless codes for different UEP setups by optimizing degree distributions 
for each source and relaying parameters along each other employing the state-of-the-art multi-objective 
genetic algorithms NSGA-II [9|. Finally, Section [IV] concludes the paper. 

II. DU-RATELESS CODES 
Rateless codes can generate a limitless number of output symbols from k input symbols based on a 
degree distribution {fii,fi 2 , • • • ,^k}, where fij is the probability that an output symbol has degree i, 
and J2i=i^i = 1- This probability distribution can also be shown by its generator polynomial Q(x) = 
Y!i=i^i xl - m rateless coding, first an output symbol degree d is randomly chosen from fl(x). Next, d 
input symbols are chosen uniformly at random from k input symbols and are XORed together to generate 
an output encoded symbol. Q(x) is usually finely tuned such that k input symbols can be decoded from 
any ^k collected output symbols at decoder, where 7 is a number slightly larger than one and is called 
coding overhead. 

Rateless decoding process consists of one step: Find an output symbol such that the value of all but 
one of its neighboring input symbols is known. The value of the unknown input symbol is computed by 
a simple XOR. We apply this step until no more such output symbols can be found. 

In DU-rateless coding, each source performs rateless coding with a distinct degree distribution on its 
data block and forwards its output symbols to the relay. For the sake of simplicity in analytical expressions, 
we consider a case with r = 2. Consider a distributed data transmission with two sources S\ and s 2 , and 
data block lengths pk and k, respectively, where < p < 1. s 1 and s 2 encode their input symbols with 
degree distributions Q(x) and <p(x) with the largest degrees B\ and B 2 , respectively, and forward them 
to the relay (see Figure [TJ). Relay R receives output symbols from two sources and performs as follows. 

1) With probabilities p\ and p 2 it relays the first and the second source's output symbol to the destination 
D, respectively. 

2) With probability p 3 = 1 — pi — p 2 it combines two incoming symbols and forwards the combined 
symbol to the destination. 

00-0 

_k__ 
00^0 

Fig. 1. Adopted model for DU-rateless codes, si, S2, R, and D represent distributed sources, relay, and destination, respectively. 

The proposed DU-rateless code ensemble is specified by parameters (pk,k 1 Q(x),ip(x) 1 pi,p 2 ,p3,'y). 
DU-rateless decoding is the same as rateless decoding. The decoding is successful when (1 + p)^k output 
symbols are received at the destination. 

Following [3], we may view the input and output symbols as vertices of a bipartite graph C7, where 
the input symbols are the variable nodes and the output symbols are the check nodes. Without loss of 
generality, throughout this paper we may assume that the symbols are binary symbols for simplicity. 

In DU-rateless coding described above, the corresponding bipartite graph at the receiver has two types 
of variable nodes (input symbols from si and s 2 ), and three types of check nodes generated by the relay 
as fepteeffffi^Fifure ^ ^ nodes in the first group are generated based on Vt(x) and are only connected 
to input symbols of s\. Similarly, the check nodes in the second group are generated based on (p(x) and 
are only connected to input symbols of s 2 . Finally, the check nodes in the third group are generated 
using input symbols from both s\ and s 2 with a degree distribution equal to Vt{x) x Lp(x) Q. It is worth 
noting that a check node belongs to the first, second, and third group with probabilities pi,p 2 , and ^3, 
respectively. 

To investigate the recovery probability of an input symbol in DU-rateless codes, we first extend And-Or 
tree analysis (8]|, H technique to fit our problem. Then, we map decoding of DU-rateless codes to 
extended And-Or tree analysis, and evaluate the recovery probability of input symbols. 
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Fig. 2. The bi-partite graph representing input and output symbols for r = 2. 



A. And-Or Tree Analysis Technique 

Consider two And-Or trees (81 T) i and 7} 2 with depth 2L Assume that TJ i and 7] >2 have Type-X 
and Type-Y OR- nodes and Type-I, Type-II, and Type-Ill AND-nodes. For each tree, the root of the tree 
is at depth 0, its children are at depth 1, their children at depth 2, and so forth. Each node at depth 
0, 2, 4, . . . , 21 — 2 is an OR-node (and it evaluates logical OR operation on the value of its children), and 
each node at depth 1, 3, 5, . . . , 21 — 1 is called an AND-node (and it evaluates logical AND operation on 
the value of its children). The root of TJ i is a Type-X OR-node, and the root of 7] 2 is a Type-Y OR-node 
as depicted in Figures [3] and respectively. 




Fig. 3. Ti t i And-Or tree with two types of OR-nodes and three types of AND-nodes with a Type-X OR-node root. 

We assume that in both T t>1 and 7J 2 , Type-X OR-nodes choose i £ {0, . . . , A{\ and j £ {0, . . . , A t } 
children from Type-I and Type-Ill AND-nodes with probabilities 5 it i and 8j i, respectively. Furthermore, 
Type-Y OR-nodes choose i £ {0, . . . , A 2 } and j £ {0,...,A 2 } children from Type-II and Type-Ill 
AND-nodes with probabilities <5 ij2 and <5j j2 , respectively. 

Further, Type-I AND-nodes choose i £ {0, . . . , Bi — 1} children from Type-X OR-nodes with probability 
(3i i, and Type-II AND-nodes choose i £ {0, . . . , B 2 — 1} children from Type-Y OR-nodes with probability 

Moreover, in 7} l5 Type-Hi AND-nodes choose j £ {0, . . . , B± — 1} and i £ {1, . . . , B 2 } children from 
Type-X and Type-Y OR-nodes with probabilities /^i and (3 i)3 , respectively. Note that Type-Ill AND-nodes 
in Ti i should have at least one child from Type-Y OR-nodes, since otherwise it is a Type-I AND-node. In 
addition, in TJ i2 , Type-Ill AND-nodes can choose j £ {0, . . . , B 2 — 1} and % £ {1, . . . , B{\ children from 
Type-Y and Type-X OR-nodes with probabilities (3j j2 and /3j 4 , respectively. Similar to Type-Ill AND- 
nodes in T; i, Type-Ill AND-nodes in 7] 2 need to have at least one child from Type-X OR-nodes to be 



Fig. 4. Ti t 2 And-Or tree with two types of OR-nodes and three types of AND-nodes with a Type-Y OR-node root. 



distinguished from Type-II AND-nodes. 

Finally, we assume that in both 7] i and T} 2 the ratio of the number of AND-nodes of Type-I, Type-II, 
and Type-Ill is p\, p 2 , and p 3 = 1 — p\ — p 2 , where < pi < 1, VI 

Type-X and Type-Y OR-nodes at depth 21 are independently assigned a value of with probabilities 
?/o,i and 2/0,2, respectively. Also OR-nodes with no children are assumed to have a value 0, whereas 
AND-nodes with no children are assumed to have a value 1. We are interested in finding yn and y^ 2 , 
the probabilities that the root nodes of TJ 1 and T L2 evaluate to 0, respectively, if we treat the trees as a 
Boolean circuits. Lemma \T\ formulates y^\ and yip. 

Lemma 1: Let y til and y ij2 be the probabilities that the roots of And-Or trees T^i and T/ >2 evaluate to 
0, respectively. Then / Si-i 

via = 5i 1 1 — pi E - y1-1.1T- 

B t +B 2 -2 d \ 

P3 E E^ 1 - 2/i-i,i) J '^-i+i, 3 (l - m-ia) d - 3+1 ] , 

d=0 '=° ' (1) 

/ B2-1 

y/,2 = s 2 1 1 - p 2 E PwO- ~ yi-^Y- 

V i=0 

B1+S2-2 d \ 

E E^(l - W-i, a ) J '^- i+ i,4(l - 2/ ; -ia) d - J+1 ] , 

ci=0 j"=0 / 

with 5 1 (x) = 5 2 (x) = £^, 2 *\ K = 3^, p& = ±™ = t^, P 2 = ^ and j/ 4 = 

i=0 i=0 

1-P1-P2 _ P3 

1— pi 1— PI ■ 

Proof: The proof is straight forward and similar to the proof of [7, Lemma 2], and is not included 
in this paper due to space limit. ■ 
The relation between the above analysis and the error probabilities for DU-rateless codes is given in 
the following. 



B. Analysis of DU-rateless Codes 

In this section, we examine the DU-rateless codes under iterative decoding. Let G denote the bipartite 
graph corresponding to a DU-rateless code at the receiver. In 0, J3), 0, 10, authors have shown that 
iterative belief propagation decoding of rateless codes can be rephrased as following. At every iteration 
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of the algorithm, messages (0 or 1) are sent along the edges from check nodes to variable nodes, and 
then from variable nodes to check nodes. 

A variable node sends to an adjacent check node if and only if its value is not recovered yet. Similarly, 
a check node sends to an adjacent variable node if and only if it is not able to recover the value of 
the variable node. In other words, a variable node sends 1 to a neighboring check node if only if it has 
received at least one message with value 1 from its other neighboring check nodes. Also a check node 
sends to a neighboring variable node if only if it has received at least one message with value from 
its other neighboring variable nodes. Therefore, we see that variable nodes indeed do the logical OR 
operation, and the check nodes do the logical AND operation. 

Consequently, we can use the results of Lemma [T] on a subgraph Gi x of G to find the probability that 
a S\ variable node is not recovered after I decoding iterations (its value evaluates to zero). We choose 
Gi x as following. Choose an edge (v, w) uniformly at random from all edges in G with one end among 
variable nodes of si. Call the variable node v the root of G^i. Subgraph G^i is the graph induced by v 
and all neighbors of v within distance 21 after removing the edge (v,w). It can be shown that G^i is a 
tree asymptotically (81. We can map encoded symbols from si, encoded symbols from s 2 , and combined 
encoded symbols in Gi to Type-I, Type-II, and Type-Ill AND-nodes in 7] 1? respectively. Further, variable 
nodes of si and s 2 in Gi t i can be mapped to Type-X and Type-Y OR-nodes in TJ i, respectively. 

In the same way, to find the probability that a s 2 variable node is not recovered after I decoding 
iterations, we choose a subgraph Gi )2 of G similar to G^i except that we choose the edge (v, w) such 
that it has an end among variable nodes of s 2 . G7 2 can be mapped to 7] 2 in the same way that G11 is 
mapped to 7}^. 

To complete DU-rateless codes analysis, we only need to compute the probabilities /^i, /3j j2 , A,3> A,4> 
and functions 5i(x) and 5 2 (x), which are given in the following Lemma. 

Lemma 2: Consider trees TJ 1 and I} )2 that are derived based on a (pk, k, £l(x), ip(x),pi,p 2 ,P3, 7) DU- 
rateless code graph G. The probabilities /3j ):L , f3 i>2 , A,3» A,4> an d functions 8i(x) and 5 2 (x) are given 
as 

e (l- P2 )A'i7 ii 7 £i (2;-l) j( 5 2 (- a ,) = e (l- P i)n2i{l+p)(x-l) ^ 

(i + _ (i + 1)^+1 

ft'(l) ,Pl ' 2 ~ cp'(l) ' 
<Pi, and ^,4 = Q{, 

where pi = O'(l) and p 2 = are the average degrees of the two coding degree distributions Vt{x) 
and <p(x). 

Proof: We have /^i is the probability that a randomly chosen edge in 7} i is connected to a Type-I 
or a Type-Ill AND-node with % children among Type-X OR-nodes. This is the probability that the edge 
is connected to a Type-I or Type-Ill AND-node of degree % + 1 (excluding edges connected to Type-Y 
OR-nodes from Type-Ill AND-nodes). It can be seen that out of ^pkVt'il) total edges connected to Type-I 
and Type-Ill AND-nodes from Type-X OR-nodes, ^pkii + l)Vt i+ i edges are connected to AND-nodes of 
degree % + 1. Therefore, we have (3 itl = ■ Similarly, we have (3 ii2 = ■ Moreover, /3 i)3 is 

the probability the a randomly chosen edge in 2] j is connected to a Type-Ill AND-node with i children 
in the Type-Y OR- node. This simply gives ^3 = tfi. In the same way, ^4 = Oj. 

We have ^1 is the probability that the variable node connected to a randomly selected edge has degree 
i + 1 given that the variable node belongs to Type-X OR-nodes. The total number of edges connected 
to Type-X OR-nodes is pi^k(l — p 2 ) ^J 3 ' out of which (i + l)X i+ i t ik edges are connected to OR-nodes 
of degree i + 1, where A i+ i i is the probability that a variable node of si has degree i + 1. We observe 
that A d ,i = ( (1 ~ P2) ^ 7fe(1+p) )('^) d (l - ^)( 1 -P2)^i7fc(i+p)-rf since ^(1 + p ) 1 k{l-p 2 ) edges are connected 
uniformly at random to si's variable nodes. Therefore, we have ^1 = — (»+ 1 ) A »+i,i _ ^fter substitution, 
we have d^x) = ^-p^n^i^) _ similarly, we have 8 2 {x) = e (^pi)^i(i+ P )(x-i) _ u 



6i(x) = 
Pi,i = 

A,3 = 
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Lemma Q] and Lemma [2] give two sequences {yi,i}i and {|/z, 2 };, which are decreasing and convergent 
with respect to the number of decoding iterations, / 0, (61. Let BERx and BER 2 denote the corresponding 
fixed points. These fixed points are the probabilities that Type-X and Type-Y OR-nodes are not recovered 
after / decoding iterations. In other words, these fixed points are the final decoding error rates of a 

(pk, k, Q(x),(p(x),pi,p2,P3,y) DU-rateless code. 

III. DU-RATELESS CODES DESIGN 

In this section, we employ our analytical results in the previous section to design optimal DU-rateless 
code parameters for different setups. For DU-rateless coding with r = 2, two error rates BERx and BER 2 
are defined. The values of these two error rates are dependant, i.e. improving one error rate by modifying 
DU-rateless code parameters may result in degrading the other error rate. In other words, we are dealing 
with two dependant error rates. Consequently, if we consider error rates as conflicting objective functions, 
we have a multi-objective optimization problem. 

Since we have more than one objective functions to minimize, we need to employ pareto optimality 
concept. In Figure [51 we have depicted a simple minimization problem with two conflicting objective 
functions and two decision variables. Assume that shaded area in decision space is mapped to the shaded 
area in objective space. We can observe that, three sets of variables shown on the decision space result in 
Fx's and F 2 's that no other decision variables can concurrently surpass. These solution are called pareto 
optimal or non-dominated solutions, and their mapping to objective space is called pareto front. We can 
observe that in contrast to single objective optimizations, we can have infinite number of optimum decision 
variables. 

X! Pareto-optimal solutions F[(Xj,x 2 ) 




x 2 Fj(x,,X 2 ) 
Decision space Objective space 



Fig. 5. Concept of pareto optimality, pareto front, and domination for a two-objective minimization problem with two decision variables, 
xi and X2- 

Multi-objective optimization methods such as NSGA-II \9\ search to find solutions that result in pareto 
front. 

A. DU-rateless Codes Design Employing NSGA-II 

We fix the parameters 7 = 1.05 and Bi = B 2 = 100, and employ the state-of-the-art multi-objective 
genetic algorithm NSGA-II |[9l to find the optimum value for Q(x) and (p(x) along with relaying parameters 
pi, P2, and P3 that minimize BERi and BER 2 for various values of rj = |||^ and p E {0.3,0.5, 1}. In 
other words, we have a problem including two objective functions, BERx and BER 2 , with 202 independent 
decision variables, i.e. x = {fix, ^2, ^100, <Pi,<P2,---, ViocPi,^}- 

The resulting pareto fronts for p E {0.3, 1} are depicted in Figure [6] in objective space. 

Note that each point in Figure [6] embodies two degree distributions and three relaying parameters, and 
none of these points dominate another member in the pareto front. One should choose an appropriate 
point according to a desired rj (UEP parameter), and employ the corresponding DU-rateless code. Since 
optimization results cannot be reported in the paper due to huge number of members, they are made 
available online at |[T0ll forpG{0.3,0.5,l}. Note that 77 = 1 corresponds to equal error protection (EEP) 
case where data from s 1 and s 2 are equally protected. 



7 




B. Performance Evaluation of Designed Codes 

From the sets of optimized degree distributions available at [UOl . we choose DU-rateless codes for 
r] E {10,100} and p = 1 and evaluate their performance in Figure |7J For comparison, we have also 
provided BER X and BER 2 for EEP case (rj = 1). 
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y i 
(a) BERs for rj = 10. (b) BERs for 77 = 100. 

Fig. 7. The resulting BERs with optimized sets of parameters for rj £ {10, 100}, 7 = 1.05, and p = 1. 

Figure [7] illustrates that the expected UEP property is fulfilled for 7 = 1.05 with the minimal values of 
BER\ and BER 2 . The parameters of a DU-rateless code for p = 1 and 77 = 10 are given as pi = 0.4822, 
p 2 = 0.1173, p 3 = 0.4005, 

n(a:) = 0.039a; 1 + 0.492s 2 + 0.094a; 3 + 0.09a; 4 + 0.096a; 5 

+ 0.002a; 6 + 0.055a; 7 + 0.019a; 8 + 0.033a; 9 + 0.014a; 10 
+ 0.004a; 20 + 0.006a; 27 + 0.005a; 31 + 0.005a; 43 + 0.005a; 78 
+ 0.005a; 86 + 0.014a; 95 + 0.007a- 100 , 

and 

<p(x) = 0.072a; 1 + 0.48a; 2 + 0.055a; 3 + 0.051a; 4 + 0.063a; 5 

+ 0.059a; 6 + 0.037a; 7 + 0.026a; 8 + 0.025a; 9 + 0.036a; 10 
+ 0.005a- 15 + 0.003a; 28 + 0.005a; 37 + 0.002a- 44 + 0.002a; 70 
+ 0.002a- 77 + 0.003a; 83 + 0.004a; 93 + 0.052a- 95 + 0.002a; 97 , 

We can see that to achieve an optimum distributed coding 40.05% of generated symbols should be 
combined in the relay. The performance of this code is illustrated in Figure |7(a) 
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We emphasis that the interesting point of our approach is optimizing all codes' parameters along each 
other using multi-objective genetic algorithms. Note that conventional linear programming optimization 
methods may not be able to optimize all parameters of our code at the same time resulting in suboptimal 
code design. 

Further, finding a general analytical expression for r > 2 is the next step in our future research. 

IV. Conclusion 

In this paper, we proposed distributed rateless codes with unequal error protection (UEP) property. 
Besides providing UEP property, DU-rateless codes do not need that all sources to have equal data block 
lengths. First, we analyzed DU-rateless codes employing And-Or tree analysis technique, and then we 
designed several optimum sets of DU-rateless codes using multi-objective genetic algorithms. Finally, 
we evaluated the designed codes employing simulation results. Simulation results show that DU-rateless 
codes fulfill the expected UEP property with minimal error rates. 
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